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ABSTRACT 


The object of this study is to find an approximation to the discrete 
optimal Kalman filter gain schedule by closed-form analytic expressions. 
In doing so, required table storage and/or on-line computation time can 
be reduced at little expense in terms of filter performance degradation. 
The method of least squares was used to determine the closed-form solution 
which was the best fit to the discrete Kalman filter gain schedule. The 
criterion for performance degradation was the difference between the val- 
ues of the diagonal elements of the estimation covariance matrix, Pie 
obtained by using the Kalman gain schedule, and the corresponding values 
obtained by using the closed-form analytic expressions for the elements 
of' the gain matrix. Examples are presented to show that near-optimal 
results were obtained utilizing this method. A comparison of the results 


of this study with another near-optimal estimation scheme is also included. 
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I. INTRODUCTION 


The theory of optimal estimation has undergone considerable investi- 


gation in recent ears Renn nee | . The objective of estimation is the 
production, in some optimal way, of estimates of the state vector, which 
In the 


is denoted by x, , from some set of observations, denoted by 


k =x" 
early 1960's, R. E. Kalman initiated a new formulation of the Wiener 
filter theory expressing the results in the time domain rather than the 


frequency domain [6.7.8] . 


The vector dynamic case of the linear filtering equation can be 


stated by 
A A 
= + G - 1 
ye esi ee Oe - 
where Xo is the estimate of x. Kalman found that the optimal estimate 
A 


Ky? in a minimum-variance sense, could be obtained by solving the dif- 


ference equations 


“ T i a 
maar area i R| (2) 
Pa = (I-GH)P) 4,4 (3) 
- T 
Pee ee +e © 


with PO /-1 —- VAR [ ¥o — Xo| and x) = EF [Xo} The definitions and dimen- 


sions of the matrix quantities in the above equations may be found in the 
Table of Symbols and Abbreviations. 

. ; : , A 

It is obvious that, in order to solve for successive values of Ke 


in equation (1), it is necessary to solve equations (2), (3), and (4) 


recursively. This process involves a considerable number of matrix mani- 
pulations. Consequently the on-line computation time would be large. Also 
considerable storage space would be needed if the problem were to be redone 


at a later time, utilizing the same matrix of gains. This estimation 


n + n(nel) 
Z 


are non-linear. For large-order 


scheme requires the implementation of 


n(n+1) 
Z 


difference equations 
Or an a order system of which 
systems, this scheme may be of such complexity that it is undesirable for 
a given application. 

If it were possible to derive an analytic closed-form expression which 
would approximate the optimal Kalman filter gain schedule with acceptable 
degradation of filter performance then on-line computation time would be 
greatly decreased since no matrix manipulations would be necessary. Also, 
the need for table storage of the gains, Gy» would not be necessary since 


it would be a simple matter to calculate them again when they were needed. 


Ii vee THEORY 


Appendix A illustrates that, according to the criterion of least 
Squares, the best fit curve to a set of data is one in which the sum of 
the squares of the errors of the points is a minimum. The method used to 
obtain this best fit curve is known as the Method of Least Squares. 

If this method can be used to produce a curve which is the best fit, 
in a least-squares sense, to the set of optimal gains, then the equation 
for that curve would be a sub-optimal closed-form solution for those 
gains. 

Therefore, all that is necessary to produce a candidate closed-form 
analytical approximation for the optimal Kalman filter gain schedule is 
to fit a least squares curve to the set of optimal gains. 

Observations of typical optimal Kalman filter gain schedules indi- 
cated that these gains would most likely be best fit with exponential 
curves. In order to take advantage of existing computer programs on least- 
Squares curve-fitting it is necessary to convert the exponential form to 
a linear form. Appendix A illustrates the method by which this transfor- 
Mation was made. Once the transformation to a linear form has been made 
the method of least-squares curve-fitting can be applied and the results, 
through an inverse transformation, can be used to obtain the desired ex- 
ponential expressions. 

Equations (2), (3), and (4) reveal that the solution for the optimal 


gain matrix depends on the quantities P HS ORs (} and Q where 


k/k-1? 


g= pyri (5) 
and N is the random force covariance associated with the particular prob- 


lem. The definitions and dimensions of these quantities may be found in 


the Table of Symbols and Abbreviations. In order to reduce the complexity 


of the system, for that is the basic goal of this study, some simplifying 


assumptions are made. [t is necessary to initialize the prediction co- 
variance matrix PskeL’ The diagonal elements of this matrix indicate 
the confidence given to the initial filter state vector. It will be as- 
sumed that little confidence is given, thus the diagonal elements of 
Po/-1 will be large. The off-diagonal elements indicate to what extent 
the initialization errors of the filter states are dependent. It will be 
assumed that they are independent and thus the off-diagonal terms of 
gyal will be zero. 

It is also assumed that the measurement matrix, H, the state trans- 
mission matrix, Q, and the [ matrix, which is used to compute the co- 
variance of perturbation matrix, Q, are constant for a given plant. That 
is to say that once these quantities are known for a specific process or 
plant, they remain unchanged during the operation of the plant or process. 

Thus, once the nature of the plant or process is known, the only 
variables involved in the determination of the optimal gain matrix are the 
measurement noise variance, R, and the random force variance, N. These 
quantities will be assumed constant in time for a given problem, but may 
vary from problem to problem. 

With these assumptions it is possible to develop closed-form expres- 
sions for the elements of the gain matrix in terms of the quantities R 


and N. 
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TIT. PROCEDURE 


As stated above the object of this study is to develop a method for 
finding a sub-optimal closed-form analytic expression for the optimal 
Kalman filter gain schedule which would be easier to implement and reduce 
—ee table storage and/or on-line computation time while keeping fil- 
ter degradation to a minimum. 

The best way to present the method of this study is to follow an 
illustrative example which demonstrates the technique used to obtain the 
desired results. Note that this method is designed to be used with a 
plant or process which has a known, non-variable configuration, with the 
only variables being the measurement noise variance, R, and the random 
force variance, N. 

The problem discussed for purposes of demonstration will be one where 
the motion of a mass particle in one dimension is considered. The ob- 
server will initialize the problem by indicating when he sees the particle 
pass a zero point. Subsequent measurements of the particle's position 
will be made at a period of T seconds as measured by a clock. The filter 
then must produce minimum variance estimates of the particle's position 
and velocity at discrete times kT where k 2 0. 

The system state variables chosen were position and velocity. The 


vector matrix form of the state equations can be expressed as 


iE deft): ‘ 


Assuming that the random forcing function, f, is piecewise constant, and 
recalling the definitions of §(T) and [(T) given in the Table of Symbols 


and Abbreviations, the solution of equation (6) is given by 


11 


ee +17 Qx, + r pit | (7) 


ly 
Bg “ 


r 
1/M } 2 (9) 
t 1% 


where 


§(T) 


and 


Ge), 


Since only measurements of position are made the measurement matrix is 


given by 
5 ae | gti (Oe ee (10) 


From equation (5) the covariance of perturbation matrix, Q, is given by 


Di (11) 


Next the problem must be initialized. Assuming that the observer 
Signals when he thinks that the particle is initially at a position zero 
and also assuming his guess of the particle's velocity to be 10.0 units/ 


sec., the filter's initial state vector is 


A «| ono 
“0/-1 — 2 


As discussed above, when initializing the prediction covariance matrix 
it is necessary to consider how much confidence is given to the values of 
the observer's eye-sight and reflexes as well as his ability to estimate 
the particle's initial velocity. Being distrustful of his ability the 


initial covariance matrix becomes 


di 


100 Q 


The diagonal elements indicate the lack of confidence in the initialization 
of the state vector of equation (12). 

Assuming a value of unity for the mass of the particle, values of R 
and N may be chosen and the optimal gain schedule for the Kalman filter 
may be found using equations (2) through (4). 

The problem to be considered now is how to achieve a closed-form 
analytic approximation for G, so that equation (1) may be solved to ob- 
tain estimates of x. 

Before going further a quantity to be known as the pseudo signal-to- 


noise ratio of the system must be defined. This signal-to-noise ratio 


is defined as N/R where 


2 T 
N=E E wf | (14) 
and 


(15) 


ye] 
1) 
es 
I< 
i 
Is, 
boss 
@ 


To make this signal-to-noise ratio a scalar quantity it must be stipu- 


lated that the random measurement noise signal, v., and the random forc- 


k? 
ing function, ie are scalar quantities for this problem. Thus the quan- 
tity N/R is a scalar measurement of the signal-to-noise ratio of the 
system. 

The example problem, with initial conditions as shown above, was run 
for various values of the signal-to-noise ratio N/R. Fig. 1 and Fig. 2 
show a family of curves for the gain matrix elements G(1,1) and G(2,1) 


repsectively and the associated signal-to-noise ratio of each. As stated 


earlier it was decided that the points of G(1,1) could best be approxi- 


mated by the sum or difference of two exponentials. Using the method 
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illustrated in Appendix A of reducing an exponential to linear form it 
was a simple matter to take the logarithms of the points and obtain a least- 
Squares fit curve for each set of points. The curves obtained as well as 
the equations for these curves are found in Fig. 1 and Fig. 2. 
Steady-state gain as a function of signal-to-noise ratio is plotted 
in Fig. 3. These plots indicate that the steady-state gains also behave 
in an exponential manner with respect to the signal-to-noise ratio. Fur- 
ther investigation showed that the exponents of the equations in Fig. l 
and Fig. 2 behave in a like manner. Using the same method as discussed 
above, closed-form solutions for these quantities were developed in terms 
of the signal-to-noise ratio N/R. The analytical closed-form solution 
and its related equations are called the sub-optimal equations. These 


appear in equations (4.1) through (4.9) of Appendix B. 
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IV. RESULTS 


Once the closed-form analytic approximation for the gain matrix, Gh 
is found it is an easy matter to test the results against the recursive 
equations of the optimal Kalman filter. Since the Kalman filter gains are 
optimum in a minimum-variance sense it is necessary only to compare the 


diagonal elements of the P matrix obtained using the closed-form sol- 


k/k 
ution for Gh with the respective elements of the Pe Matrix obtained 
using the optimal Kalman filter gain schedule. 

Since the gains obtained from the solution of the closed-form equa- 
tions are non-optimal, equation (3) may not be used. Instead it is ne- 
cessary to derive a general equation for all values of Pik obtained with 


other than optimal gains. Such an equation has been der iveducl which can 


be expressed as 


iG 


2 flvaye 1 
Ie G) HP HG, + G CHP) H+ R)G (16) 


ge e/a. k/k-1 


Figures 4 and 5 show the comparison of the diagonal elements of Pile 
for the optimal case and those for the case where Gy was determined by 
the closed-form expressions developed. These figures indicate that the 
values obtained using sub-optimal gains are higher than the optimal val- 
ues. To determine how much this difference affects the estimation capa- 
bilities of the filter this problem was simulated and a comparison was 
made between the actual value of velocity for the point and the estimated 
value of velocity for both the optimal and sub-optimal case. Figure 6 


indicates that, for this problem, the estimating capabilities of the 


optimal and sub-optimal filters are similarly effective. 
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V. COMPARISON OF RESULTS WITH A PREVIOUS STUDY 


Sims and Mails at nage reported on attempts to achieve, for the con- 
tinuous case, near-optimal estimation uSing a structure which is easier 
to implement than the optimal Kalman solution. Their method, called 
Specific Optimal Estimation, is described in Ref. 13. The following is 
an attempt to adapt their methods and results for the continuous case to 


the discrete case. 


A. THE DISCRETE LINEAR PROBLEM 


Consider the linear message model 


- “ 
ary gor LS 1 
with observation model 
Zioe Hx, ee, (18) 


where u and v. are white noise processes such that 


E fo, of] = 
fe = 


: i, - aa (19) 
fa) =e] °° 


: : ae : ; A 
The optimal linear minimum variance estimate, x » of the state 


~K/k 
vector xX, is given by 
A A 
= + G = 20 
ZK/k =k/k-1 7k E 4-1 oul 


ps 


where 


G =P H’ | HP ‘Utne pills (21 
k k/k-1 k/k-1 ) 
and Pisk is the solution to the matrix difference equations 
= a 
Peet /k ~ Pah + 9 ce) 
A A 
and Po/-l VAR EX, | : Xp = Eel 


It is proposed in this paper, to find an estimate of x, having 


A 
% 

—k/k k 

the same form as equation (20), but with the gain matrix, Gps constrained 


to be of a specific configuration. In particular, G. is assumed to be 


the solution to the difference equation 


Gy = F(k,a) Go = D (25 


where a is a constant-parameter vector to be determined optimally and D 
is a matrix of initial conditions which may or may not be determined opti- 
mally. 


Let x. 


A . , 
te a: be the estimation error, and assume that the obser- 


vation interval be discrete. The error criterion used is 


fi 
= eel fy 7] (5 


k 


This may be written as 


it 
eur € e[ ex] (26) 
1 ; 


Making use of the definition 


- ~ oT 
a a e[ 2 | | (27) 


Z3 


equation (26) can be written 


P (28) 


C, 
il 
ct 
MH 

Il AA ke 


l k/k 


The problem then is to determine the parameters in the equation of G) 
such that J is minimized. If it is assumed that the estimate, Xy gs is 

of the form specified in equation (20) it is easy to verify that Pik satis- 
fies the difference equation 


Z = lye z a 
Pei = Pe/k-1 GHP) yt Gy oe CG, CHP) 1H + R)G). | (29) 


Sims and Melsa determined the parameters in the equation for Gy which 
minimized equation (28) by developing the Hamiltonian for Pres: deter- 
mining the equations for the Lagrange multiplier matrices, and obtaining 
the solution by solving the problem as a two-point boundary problem with 
appropriate boundary conditions. 

Since the equation for ee is a difference equation, in the discrete 
case, rather than a differential equation, this approach cannot be used. 
One might solve equation (29) recursively with equation (4) for all val- 
ues of k in the summation, and set the derivative with respect to the 
undetermined parameters of Gy equal to zero to determine the gains which 
would minimize equation (28), but an inspection of this method by the 
author revealed that this would be too tedious. The approach adopted 
here, although being equally tedious, lent itself to a computer solution. 
This approach involved setting the gain matrix to a constant in equation 
(29) and evaluating the cost function of equation (28). One element of 
the gain matrix was held constant and the other was varied until a mini- 
mum value for J was found. The element of the gain matrix which had 


been held constant was then incremented and the procedure was duplicated. 


As a result Fig. 9 was obtained. The verticle lines are proportional to 
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the value of the cost function at each local minimum shown. The coordi- 
nates of the base of each verticle line are the values of G(1,1) and G(2,1) 
which gave the value of the cost function shown. The values of gain at 
which the cost function reached its absolute minimum were G(1,1) = .852 

and G(2,1) = .466. These are compared with the optimal steady-state val- 
ues of G(1,1) = .700 and G(2,1) = .400. Using the values of gain obtained 
above the illustrative problem was run and the results are presented in 
graphic form in Fig. 4 and Fig. 5. 

Note that the gains obtained above were close to the steady-state 
values of the optimal gain matrix. It would be expected that these gains 
would become closer to the optimal steady-state values as the number of 
samples increases. To show this the solution was performed again for ten 
samples, vice twenty. The values of the gain matrix for minimum cost ob- 
tained were G(1,1) = .902 and G(2,1) = .516. Thus, with a smaller number 
of samples, the sub-optimal constant gains are further from the steady- 


state optimal gains. 
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VI. CONCLUSIONS 


It is evident from Fig. 4 through Fig. 6 that the Kalman-type filter 
utilizing the closed-form equations for sub-optimal gains performs in a 
near-optimal manner with little degradation in the performance of the 
filter. The question now arises as to whether this method of determining 
the filter gains does fulfill the objective of this study; that is, has 
required table storage and/or on-line computation time been reduced? A 
comparison of equations (4.1) and (4.2) of Appendix B with equation (2) 
above indicates that the former equations are easily and quickly solved 
for all values of k, once the signal-to-noise ratio, N/R, is determined, 
while in the latter equation it 1s necessary to perform four matrix pro- 
ducts, two transposes, and a matrix inversion in order to solve for one 
value of the gain matrix. In order to solve this equation for all values 
of k it is necessary to solve equations (3) and (4) recursively along with 
equation (2). This procedure requires four additional matrix products, 
an additional matrix transpose, and a matrix addition and subtraction in 
order to obtain each additional value for the gain matrix. Thus the 
closed-form gain equations will greatly reduce on-line computation time. 

This same argument can be used to show that required table storage 
has also been reduced. Assume, for instance, that a problem is to be 
performed over again at a later time. In the case of the Kalman filter 
it is necessary to store the gain schedule, since it would be too costly 
in on-line computation time to re-compute the schedule. Since the closed- 
form equations are much more quickly solved it 1s not necessary to store 


the values of gain obtained in this manner. 


pe | 


Recall that it has been assumed that all factors which influence the 
gain matrix are to be constant from problem to problem, with the exception 
of the scalar measurement-noise covariance, R, and the scalar random-force 
covariance, N. These quantities were assumed constant for a single prob- 
lem but could change from problem to problem. Suppose now that one or 
both of these quantities changed during the problem, say at time k equal 
to ten. The optimal gain schedule for such a system might be as shown in 
Fig. 7. In a situation such as this the Kalman gain schedule would have 
to be computed on-line to maintain optimality if the change time were not 
known a priori. In Fig. 8 a plot is shown of the reaction of the closed- 
form approximations to such a situation. Although the sub-optimal gains 
are still non-optimal, they follow the general trend of the optimal gains, 
and hence may be used in their stead. 

The arguments presented above would indicate that substantial savings 
in time and space could be realized at little cost in performance if the 
closed-form approximations for the gain matrix were used in place of the 
Kalman recursive solution. 

An inspection of Fig. 4 and Fig. 5 reveals that the results obtained 
by the specific optimal estimation technique offered by Sims and Melsa 
are more near-optimal than those obtained by the method discussed in this 
paper. This fact brings out the major difference between these two methods. 
In this study the gain matrix was constrained to be an exponential, the 
parameters of which were determined by geometric means. That is, the 
parameters of the exponential were determined by requiring that the ex- 
ponential be a geometric least-squares fit to the Kalman filter optimal- 
gain matrix. In contrast the parameters of the constant-gain matrix of 


the specific optimal estimator were determined by minimizing errors in 
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filter performance. It is significant to note that the constant gains 
employed in the Sims/Melsa method gave better performance than the ex- 
ponential gains of the geometric fit method. One might speculate that 
assumption of an exponential form in the former method might even more 


closely match the optimal filter performance. 
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VII. SUMMARY 


This study has illustrated that a closed-form analytic approximation 
to the filter gain matrix for an optimal Kalman filter can be obtained 
which will reduce the computation time while procuding near-optimal esti- 
mates. This method would be useful in problems where computation time and 
storage space were limited. This method has compared favorably with both 


the optimal filter and the specific optimal estimator with which it was 


compared. 


3d. 


APPENDIX A 


METHOD OF LEAST SQUARES 


The most probable value of a quantity which is obtained from a set of 
observations is the one which corresponds to the most probable set of 
errors of observation. These errors are known as residuals. Consider 
a set of n observations of equal precision, the mose probable errors of 
which are Xp» XQ. Xg0+--X> respectively. Since the probability of the 
simultaneous occurrence of several events in a series is the product of 
their individual probabilities, and the probability of an error, x, is 

h Ped 
pipe (2 aly 
it follows that the probability of the simultaneous occurrence of the 


errors X1, X92; XZo00eX will be 


Tp 2a caw: 
r [pet] et “| a : 


Oa 2 2 
Pp = h - anh’ (x* x xX ae (2.2) 
{tr i, <2 eee ; 


that is 


Since h, n, , and e are constants in a given problem the quantity P is 
a maximum when the exponent of e is a maximum, or when the sum 
2 ae fare 
Mr 55 Ge Sek is a minimum. (223) 
i 2s n 


Thus, the most probably value of the observed quantity, or the best value, 
in other words, obtainable from the given set of observations, will be one 
for which the sum of the squares of the residuals is a minimum. This is 


called the Principle of Least Squares. 


3 


The Principle of Least Squares has been used extensively to formulate 
computer programs for determining the coefficients of a linear equation 
which is a best fit, in the Least-Square sense, to observed data. In 
general, equations of higher than first degree can be reduced to linear 
form by developing the function by Taylor's Theorem and neglecting the 
Squares, products, and higher powers of the small increments involved. 

Equations in which the unknown constant occurs as an exponent con- 
stitute a special case of reduction to linear form. In brief, the method 
is to throw the equation into the logarithmic form by taking the logarithm 
of each member. The resulting function will be linear with respect to 


the desired coefficient. Suppose the function is of the form 
y= ae?t (2.4) 


in which a and b are to be determined so as to fit all to the observa- 


tions as well as possible. Taking the logarithm of each member gives 
log y = log a + bt, (25) 
which is the linear form of 
y' =A+ bt, (2.6) 


where A and b are the unknown constants. By simply taking the inverse 
logarithm of the coefficient A the Least Squares fit curve of exponential 


form can be obtained. 


36 


APPENDIX B 


SUB-OPTIMAL GAIN EQUATIONS 
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